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Expression of hydrophobic proteins 



The invention is described in the following statement: 



141358971 



15 



20 



25 



30 



Field of th e Invention- 

This invention rehdes to the expression „ f non . native (je heter0 , e 
wh.c compnae a proportion of hydrophobic ammo acids in an expression sylmTu* * 
abac te na.hos,<eg£c 0 /,,. m particular, the ,nven«on prevides a method for designing 
cand,da,e pdyepitope pdypepddes with a, incraasad P robabU ity of being efficiency 
expressed (.em amounts detected by SDS-PAGB). One parbcu.ar appbcabon of the 
mvanbon relates ,0 the producdon of a polyapitopa polypepbde comprising 26 CTL 

^ "Cn-'^unaraaponaaforthapravanaonofdisaasasasaociatadwithEBV. 
Backgroun d of the | nvgnt|an: 

M co„ ! r K to i r m ' te Pr ° dUai0n °' reCOmb " ,ant * * bacteria, host (ag 

M con, a number of partem can ba considered including factors aifecdng transcripdon 
feg P-moter choice, etc, and facto, affecdng toansladon mechanisms such as minimi 
*r« of rare codons. However, these are unlihely to have an impact on the p^ducZf 

^mbmntpolypepHdescomprlsings.retchesofhydrophobicaminoacids.vvmchhave 
^n-"ypcov m d^cuU to p ro d„ rem ^ te ^ 1Wo „ « 

*ese hydn,phob,c sequences generally improves yields of me recombinant molecule 
ffracae,a.,1999;Hobmanatal. 19H -P*ea,al, 1 991 ;B M BLwebsite . protelnt0)d • 
www.embl-heidelbarg.de/Extemallnfo). emtoxicrty. 

The most likely reason for proems occurring in the producdon of foreign 
polypepbdes possessing regions of hydrophobic!^ (particmarly those with „„ n . native 
sequences such aa fusion proteins), is thepost-trandabona! aasociabon of nascent 
polypepbdes with chaperons proteins (ag M co,i gre-EL). GroEL is involved in the 
refoidmg precess of polypepbdes emerging from fhe ribosome and proteins wi U recyde 
through the chaperoneayatem und! me correct conformadon is achieved or thepretein is 
^.edfordegredahon. CreEL Is Known to bind hydrophobic amino acids an! .part", 

the refolding process is essentially to bury these hvdrrmlw,,„ „ 
flL y " uui y mes e Hydrophobic sequences w thin the interior 

Richarme and Kohiyama, 1994; Hendridc and Hard, 1995; Lin et al 1995) 
Polyepitope or "polytope" co^^ (ie 

aodseq„encesof,tora X amp 1 e,lto5aminoadds i n I engd,,,„ OUIdbeexpecttdt !™ n ° 
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inherently unable to internaUse any hydrophobic regions as they are not nahrraily- 

occurnngsequencesandiackthefoldingcapabiliHesinbuiltinnahveproteins Hence 
polypeptides which consist ofnon-nadve sequences. particularly those with a high 

"-^Mrophobicaminoadds.arehkdytobeseques.eredinAechapeLe 
foldmg pathway and ultimate,,, targeted for degradation if a certain degree of 
conformational stability cannot be achieved. 

a tande^"? **** ~ " ~" P-W*. -* made up of 

ZTZZ — r 01 ^ ^^pes.partfcularlymoseofL 
HLA A2 type, often compnse predominantly hydrophobic amino acids and since HLA A2 

> ^^'"-"^of^humartpopulauonftian^^^,^;^^ 
^""y^vepofyepib^va^^^ 
vaccmes are described ,„ Auslralian Palent No . ^ fc ^ 

beregardedasbetngincorporatedhereinbyreference. In this patent, vaccines a re 
^« which compdseasyntheHcor recombinant polypeptide, or a recombinant 

whtchthemferveningsequencesdonot (l)comprlse meWomne or <„> comprts! na^l 
—gflankingsequencesof me epitopes. Particularly described in mepriorp^I 
va<-e.comp riting ap 0l y^ topeva ^ iavirus apoIyepitopep ^ ' J"" 

compnsmg 9 CTL epitopes (each of, to „ amino acids m fength, fLnEBV. Ld^d 
d_ , ease assays ^ ^ vjrus ^ a ^ ^ 

HLA alleles for restriction of each epitope and using autologous CTL clones specific for 
each epnope as effector ceUs. showed tha, each epitope coujd be efndendy processed!™ 
*e polyepnope po.ypepbde since. ,„ aj, cases, me CTL Cones racogntsed and M ,ed the 

^matched ^« ce,. Wected w,m me polyep ito pe vac.ma v,rut but did not ^ 
of the negative controls (ie TK-vaccinia). 7 

soma,l!^rrT7r yindudeatogen ™ tero,m ^ ito P es ^ 10 -^ 

sothatmeHLAdrversttyo/thatergetpopulationiscovared. It has therefore been 

contemplated by the present applicant to produce an EBV „„, 

includes FBV «.> . ""produce an BBV polyepttope vaccine which 

TZ Z 2 S ^ ** A "' A23 ' «*• B7 < ■* "27, B35. B44, B46, 

populabon. Th,s would Involve the incorporation of abou t26 EBV CTL epitopes into a 
^epltopepolypepHde.Pormereason.glvenabove.ftwasexpec^Zr; 
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polypeptide would contain hydrophobic regions and that expression in a bacterial host 
would be highly problematical. 

The work leading to the present invention was aimed at elucidating a method or 
procedure for overcoming the difficulties of expressing non-native, polypeptides which 
comprise a proportion of hydrophobic amino acids (eg polyepitope polypeptides) in a 
bacterial host such as R coll The present applicants have, as a result of that work, 
identified a novel method for designing candidate polyepitope polypeptides with an 
increased probability of being efficiently expressed in a bacterial host and/or yielding a 
punned polyepitope polypeptide which is soluble in aqueous solutions. The method 
mvolves identifying one or more hydrophobic peptide sequences within a polypeptide and 
arranging or re-locating at least one of the hydrophobic peptide sequenced), so as to- (a) 
reduce or minimise amplitude (ie peaks) in hydrophobic^ across the length of the 
polypeptide, and/or (b) reduce or minimise the total length of any hydrophobic region(s) 
within the polypeptide. 



15 



Summary of the Invention: 

In a first aspect, the present invention provides a method for designing a candidate 
polypeptide for expression in a bacterial host, said method comprising, 

identifying one or more hydrophobic peptide sequences within a polypeptide of 
20 interest and 

arranging or re-locating at least one of said hydrophobic peptide sequences within 
sa ld polypeptide so as to generate said candidate polypeptide with reduced or minimised 
amplitude in hydrophobic^ and/or length of any hydrophobic region(s). 

Preferably, the polypeptide of interest is non-native to the intended bacterial host 
Smce the most preferred bacterial host is M coll most preferably the polypeptide is non- 
native to B. coll 

The polypeptide of interest will preferably be a non-natural polypeptide or even a 
theoretical non-natural polypeptide (ie a polypeptide yet to be synthesised or expressed) 
compnsing a plurality of amino acid sequences of interest some of which may be 
hydrophobic suspected to be hydrophobic, and which has been found not to be or is 
suspected not to be, efficiently expressed in a bacterial host. For such a polypeptide of 
interest, the method of the first aspect provides the possibility of identifying one or more 
hydrophobic peptide sequences, if any, within the polypeptide of interest and arranging or 
re-locating at least one of the hydrophobic peptide sequence^) so as to generate a 
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candidate polypeptide with reduced or minimised amplitude in hydrophobicity and/or 
length of any hydrophobic region(s), and therefore an increased probability of being 
efficiently expressed in a bacterial host. & 

Preferably, the polypeptide of interest may be a synthesised or theoretical 
polyepitope polypeptide comprising a tandem array of epitopes of interest (eg CTL 
epitopes, which, as is mentioned above, often predominantly comprise hydrophobic amino 
acids). For such a polypeptide of interest, the method of the first aspect permits the design 
of candidate polyepitope polypeptides comprising a large number of epitopes of interest 
(eg 10 to 35 or more) with an increased probability of being efficiently expressed in a 
bacterial host, by enabling the possibility of identifying one or more hydrophobic epitopes 
andarranging or re-locating atleast one of the hydrophobic e P ito P e( s) , so as to generate a 
candidate polyepitope polypeptide with reduced or minimised amplitude in 
hydrophobicity and/or length of any hydrophobic region(s). 

It has been found that the method of the first aspect is best applied to the design of 
a candidate polyepitope polypeptide in a manner which identifies and ranks the relative 
hydrophobicity of each of the selected epitopes (nb The epitopes of interest maybe a range 
of epitopes fromasing I epathogen(eg E BV) selected to provide a polyepitope polypeptide 
that covers the HLA diversity of the target population. The epitopesof interest may lobe 
one or more epitopes from a range of pathogens or the epitopes maybe derived from a 
non-microbial source such as a tumour cell for treating or preventing cancer.), groups the 
ranked epitopes into three groups of substantially equivalent numbers, based upon the 
identified relative hydrophobicity (ie so as to produce the groups, Group 1 = most 
hydrophobic, Group 2 = middle hydrophobicity, and Group 3 = least hydrophobic and 
"resxduaT epitopes where the total number of epitopes is not wholly divisible by 3), and 
then arranges the epitopes into triplets where the triplets contain an epitope from each 
group ( ie three linked epitopes; epitope 1 - epitope 2 - epitope 3) and arranged into a 
candidate polyepitope polypeptide having the formula, Triplet 1 - Tripjet 2 - .... - Triplet N 
as follows: r 
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Triplet 1 (N- 
terminal) 
Triplet 2 



Epitope 1 



most hydrophilic of 
Group 2 

2 nd most hydrophilic 
of Group 2 



Triplet N(C- 
terminal) 



most hydrophobic of 
Group 2 



Epitope 2 



most hydrophobic of 
Group 1 
2 nd most 

hydrophobic of 
Group 1 . 



most hydrophilic of 
Group 1 



Epitope 3 



Most hydrophilic of 
Group 3 
2 nd most hydrophilic 
of Group 3 



Most hydrophobic 
of Group 3 



25 



Between the epitope triplets, or between any or all of the epitopes within a triplet, 
there m ay be intervening sequences (preferably short sequences of 1 to 10 amino acids) 
which may optionally be hydrophilic (eg lysine-lysine) so as to reduce any local peaks of 
hydrophobic^. Preferably, the epitopes within a triplet are contiguous. 

°* er sim P Ie ™*ods ^ arranging the epitope(s) so as to minimise extremes in 
hydrophobacity in a polyepitope polypeptide will be readily apparent to persons skilled in 
the art, and are to be considered as forming part of the present invention (eg polyepitope 

poypeptide sequences based on sets of 4 epitopes could be constructed in a simOar manner 
to that described above for triplets). 

Once a candidate polypeptide has been designed in accordance with the method of 
thehrstaspectapolynu^^ 

according to any of the methods well known to persons skilled in the art. The encoding 
polynucleotide maybe incorporated into, for example, vector, such as viral vectors (eg 
vaccinia to provide a recombinant polyepitope viral vaccine) or expression vectors such as 
those suitable for expression in a bacterial host. " 

Thus, in a second aspect, the present invention provides a method* expressing a 
polypeptide in a bacterial host, said method comprising, 

designing a polypeptide in accordance with the method of the first aspect 

such * T d T S 3 ^ ynUCle ° tide encodi "* Said Peptide into said bacteria, host, 
such that sa ld bactenal host is capable of expressing said polypeptide, and 
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culturing said bacterial host under conditions suitable for expression of said 
polypeptide. 

The expressed polypeptide may be isolated by, for example, lysing the bacterial 
host and purifying the polypeptide from the produced cell lysate. 

The polynucleotide introduced into the bacterial host may encode the polypeptide 
in the form of a fusion of the polypeptide with a suitable carrier protein. Alternatively, the 
polypeptide could be expressed and subsequently linked to or otherwise associated with a 
suitable carrier protein. Suitable carrier proteins are well known to persons skilled in the 
art and include p-galactosidase, glutathione S-transferase and the g P 350 structural protein 
from EBV or a fragment thereof. The carrier protein may comprise additional useful 

epitopes. Further increases in expression benefits provided by ordering may be conferred 
by the carrier protein. 

In a third aspect, the present invention provides a polypeptide designed in 
accordance with the method of the firet aspect. 

If desired, the polypeptide of the third aspect may be in the form of a fusion of the 
polypeptide with a suitable carrier protein. 

In a fourth aspect, the present invention provides a polyepitope polypeptide 
designed in accordance with the method of the first aspect. 

If desired, the polypeptide of the fourth aspect may be in the form of a fusion of the 
0 polypeptide with a suitable carrier protein. 

In a fifth aspect, the present invention provides a polyepitope polypeptide 
comprising N epitopes, wherein N is any integer preferably in the range of 5 to 50 
(preferably, 10 to 35), said polyepitope polypeptide having the formula; 

Triplet 1 - Triplet 2 - .... - Triplet N, 
wherein each of said triplets comprises three linked epitopes selected by, 

identifying and ranking the relative hydrophobicity of each of the N epitopes, 
grouping the ranked N epitopes into three groups of substantially equivalent 
numbers, based upon the identified relative hydrophobicity of the N epitopes, to produce a 
first group (ie Group 1) comprising the most hydrophobic epitopes, a second group (ie 
Group 2) comprising the epitopes having a middle level of hydrophobicity, and a third 
group (,e Group 3) comprising the least hydrophobic epitopes, and 

selecting the epitopes for each of said triplets according to the following table: 



141358971 



3 



Triplet 1 (N- 
terminal) • 



Triplet 2 



Epitope 1 

most hydrophilic of 
Group 2 

2 nd most hydrophilic 
of Group 2 



Triplet N(C- 
terminal) 



most hydrophobic of 
Group 2 



Epitope 2 

most hydrophobic of 
Group 1 
2 nd most 
hydrophobic of 
Group 1 




most hydrophilic of 
Group 1 



Epitope 3 



Most hydrophilic of 
Group 3 

2 nd most hydrophilic 



of Group 3 



Most hydrophobic 
of Group 3 



«*J^T r w flret ' 3Kond and **- ^ x ,denarai ■>-*- - 

.p. to pes.Whe reNls an integer no, whoUy divide by3(ie an in«e ger other tt^te 
^^-^^andW.^.he^doaiep.^^p^^^^^ 
rmra group. 

anv te» Al * e ^ 0tSeleainS e, " tOPeS for «"* ° f «" «F"* « *« Ware ' 

added to n ,eC-ter ral na 1 o f Tr i p,e t Horo^ may o. IO ca te d within Jcand-Z 
poiyepnope polypeptide sequence so as to reduce any iocai peaks of hydrophobic!*. 

Between the.epitope Wptets, or behveen any or tf of the epitopes within a triple,, 
^^ ta ^^s« n ,e„ce,(p^ 1 y shM ^ u ^ rfltol0aBi ^^ 

hydrophobtcty. Preferably, the epitopes within a trip.e, are contiguous 

« des.red, toe po^itope polypepHde of toe fifth aspect ntay be in toeforn, of a 

fusion of the polyepitope polypeptide with a suitable carder protein 

in a sixth aspect, toe present invention provides a polyepitope vaccine co mprisin . a 

acceptable carrier and/or adjuvant. 7 

In a seventh aspect, the present invention provides.a polyepitope polypeptide 
compnsmg an amino acid sequence substantially corresponding to: 

FLRGRAYGL - PYLFWLAAI - HRCQAIRKK - RPJYDLEBL - VQPPQLTLQV- 
GLCTLVAML - RLRAEAQVK -IEDPPFNSL -YLLEMLWRL - GQGGSPTAM - 
A VLLHEESM - IALYLQQNWWTL-RAKFKQll - SSCSSCPLSKI- TYGPVFMCL- 
QAKWRLQTL- RPPIFIRRL- VSFIEFVGW - YPLHEQHGM -VEITPYKPTW - 
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CLGGLLTMV - EENLLDFVRF -' TYSAGIVQI - LLDFVRFMGV -EGGVGWRHW 
(SEQIDNOrl); 



or 



FLRGRAYGL- PYLFWL AAI - HRCQAIRKK - RRIYDLIEL- GLCTLVAML- 
5 RLRAEAQVK- IEDPPFNSL -TYSAGIVQI- LLDFVRFMGV- EGGVGWRHW- 

IALYLQQNWWTL- RAKFKQLL - SSCSSCPLSKI- TYGPVFMCL- QAKWRLQTL- 
RPPIFIRRL - VSFIEFVGW -YPLHEQHGM- VEITPYKPTVV- CLGGLLTMV- 
EENLLDFVRF - YLLEMLWRL - GQGGSPTAM - AVLLHEESM -VQPPQLTLQV 
(SEQIDNO:2). V 

10 

In an eighth aspect, the present invention provides a polyepitope vaccine 
comprising a polyepitope polypeptide according to the seventh aspect and a 
pharmaceutical^ acceptable carrier and/or adjuvant. 

Detailed Description of th» invention. 

The present applicants have identified novel methods for designing a candidate 

polyepitope polypeptide, with an increased probability of being efficiently expressed in a 

bactenal host (ie in amounts detectable by SDS-PAGE). The method involves identifying 

one or more hydrophobic epitope(s) and arranging or re-locating at least one of the 

hydrophobic epitope(s) so as to generate a candidate polyepitope polypeptide with 

reduced or minimised amplitude in hydrophobic^ and/or length of any hydrophobic 
region(s). 9 J r 

An algorithm to calculate hydrophobic^ values of amino acid sequences and 
subsequendy arrange sequences to; (a) reduce or minimise amplitude* hydrophobic^, 
and/or (b, reduce or minimise the length of hydrophobic sequences, was genereted and 
applied to 26 CTL epitope from ^ ^ ^ fc ^ ^ 

cand,da,e polyepitope polypeptides (designated PT26A and PT26B, described hereinafter) 
one of which proved to be efficiently expressed in E cM Ihe expreased polyepitope 
polypeptide shows promise as me basis of an EBV vaccine for prevention or treatment of 
mfecbous mononucleosis and/or EBV-related cancers such as Burld«s lymphoma. 
Hodg.onsdise.se, non-Hodg^s .ymphoma. naao-pharyngea! carcinoma, gastric 
adenocarcinoma, lymphoma, associated with immunosupresaion, lymphoepHhelioma-like 
carcinomas, and immunodeficiency-related leiomyosarcoma. 
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Whllel °°"ng for an explanation as to why m^^^ 
the two similar candidate polyepitope polypeptides were different, summations of 
hydrophobicity values (designated Hydrophobic Index (HI) values) were calculated for 
different numbers of epitopes over the length of the candidate polyepitope polypeptides to 
5 identify local areas of hydrophobicity. Summation over 3 and 4 epitopes showed that there 
were regions in the non-expressed polypeptide where the HI value was higher than in the 
expressed polypeptide. This information enabled the identification of a threshold HI 
value, such that polypeptide sequences which comprised a region with an HI value in 
excess of the threshold value, could be predicted as being less likely of being efficiently 
10 expressed in a bacterial host. 

Thus, in a preferred embodiment of the methods of present invention for designing 
candidate polypeptides, the methods involve initially calculating hydrophobicity values 
and arranging peptide sequences to; (a) reduce or minimise amplitude in hydrophobicity 
and ^-e°r-immise^ | f 

15 necessary, by calculating mem values over different peptide sequence groups, thus ' 
providing numerical values for comparison and prediction of the likelihood of a candidate 
polypeptide sequence being efficiently expressed in a bacterial host. So, in applying this 

preferred embodiment to the design of a candidate polyepitope polypeptide, the method 
involves: 

20 (i) Calculating the hydrophobic value for each epitope using a suitable algorithm 
Fauschere and Pliska, 1983 contained within the software package "Pinsoft T from 
Mimotopes Pty Ltd, Clayton, Victoria). 

(ii) Ranking the set of epitopes in order of decreasing hydrophobicity 

(iii) Dividing the rank ordered set of epitopes into 3 equal groups (ie group 1 = most 
-5 hydrophobic, group 2 = middle hydrophobicity and group 3 = least hydrophobic (most 

hydrophilic)), and including any residual epitopes (ie epitopes left over after the set is 
divided by 3) in group 3 (ie the most hydrophilic group). 

(iv) Creating triplets of epitopes by taking the most hydrophilic of group 2 (ie last in 
group 2 ), then the most hydrophobic epitope (ie number 1 in group 1) and lastly the most 
hydrophilic (ie last in group 3) until all epitopes in groups 1 and 2 have been used (nb 
Leftover" epitopes are handled as set out in step (ix) below). 

(v) Arranging the triplets into a sequence in the order in which they were produced (ie 
Triplet 1 - Triplet 2 - Triplet 3 - etc). 

(vi) Plotting the hydrophobicity of the arranged triplet sequence using a suitable 



0 
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algorithm (eg Pausehere and F» ska , 1983 , or Hopp ^ Woods lg8j 

steps (iii) and (iv) above) y 38 3 Sequence of 

The HI va lues may be cajculaeed by using the n^^, 



e=m+n-l 
HI m =Zx c 



20 

e=m 



25 



30 



(where m = group number, n = group size, e = integer (eoiton.. „ u 
hydophobiciry value). nte e e r (ep.tope) number, x = epitope 

abie to expir^Z ITT *" ™ d *- - be 

needtobelessthanLgMxim^/ . P " 0pC s «> uen< « 

ne present mventjon therefore further 
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extends to a method of predicting efficient expression * . poIypepHde in . 
mvolvmg calculating HI values in accordance with the mathematical expression: 

e=m+n-l 
HI m = Sx e 



e=m 
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^ erem = ^ u P^er,n = groups^ 

hydophobidty value). v 

The methods of the present invention permit the design of candidate polyepitope 
polypeptides comprising a large number ot epitopes of interest (eg 10 to 35 or more) with 

anmcreasedprobabili.yofbeingefficientiyexpressedinab.c.enaihost. The epitopes of 
mterest may be a range of epitopes from a single pathogen selected to provide a 
polyepitope po.ypeptide that covers the HLA diversity „, the target population, or the 
epitopes of interest may be one or more epitopes from a range of pathogens or tumour 
antigens. As is evident from the above, one particular application of the methods of the 
P^enUnvention ta to the des!g„ or candidate polyepitope po,ypeptides comprising 26 
EBV CTL epitopes for use in a vaccine to provide protection against EBV to over 90* of the 

... to the des,gn of candidate po.yepi.ope polypeptides comprising CTL epitopes from 
cytomegalovirus (CMV). for use in a vaccine to prevent or treat CMV-causative diseases 
A candidate polypeptide designed in accordance with the methods of the present 
mvention may be expressed by flretiy synthesising a polynucleotide encoding the 
candidate polypeptide according to any of the methods wel! known to persona skilled in 
*e art, and then by introducing the polynucleotide into a suitable bacterial boat Typically, 
Una wr.1 be achieved by Coning the polynucleotide into an expression vector and then 
.ntroducmg the expression vector into a srutable bacteria! host by any of the transformation 
methods we" known topersonsskUled in the art. Expression from the expression vector 
may result ,n th. polypeptide being expressed as a fusion protein comprising the 
polypeptide and a suitable carrier protein (eg p-galactosldase, glutathione S-transferase or 
the gp350 structural protein from EBV or a fragment theritof). Alternatively, the 
polypeptide may be expressed by the bacterial host and following isolation of the 
polypeptide, tire polypeptide may be linked to or otherwise associated with a suitable 
carnerprotein. The carrier protein may also confer additiona. useful properties (ie the 
earner protein may comprise useful epitopes or sequences to enhance solubility, further 
enhance punhcation procedures or facilitate association with an adjuvant) 
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It ,s also contemplated that candidate polypeptides designed in accordance with the 
methods of the present invention may be readily expressed in whole cell ly Sates and non- 
bacterial host cells, as well, and accordingly such alternative expression methods for 
candidate polypeptides are to be considered as forming part of the present invention. In 
particular, the present invention is to be considered as extending to a method of expressing 
a polypeptide in a non-bacterial host cell such as a mammalian cell (eg a CHO cell or COS 
cell line), a yeast cell (eg Saccharomyces cerevisiae) or insect cell (eg SF9 cell line), wherein 
the method comprises designing a polypeptide in accordance with the method of the first 
aspect, introducing a polynucleotide encoding the polypeptide into the host cell such that 
the host cell is capable of expressing the polypeptide, and culturing the host cell under 
conditions suitable for expression of the polypeptide. The expressed polypeptide may be 
isolated from the host cell culture by lysing the cells and purifying the polypeptide from 
the produced cell lysate, or alternatively, the polypeptide could be expressed with a 
suitable secretion signal such that the polypeptide is secreted into the culture medium 
(from where it may be purified). Designing a polyepitope polypeptide in accordance with 
the methods of the present invention may also overcome non-secretion problems which are 
sometimes experienced when a hydrophobic polypeptide is expressed with a foreign 
secretion signal. 

Where the expressed polypeptide is of pharmacological or veterinary significance 
the polypeptide may be formulated into a pharmaceutical or veterinary composition 
Generally, such compositions will comprise a pharmaceutically acceptable or veterinary 
acceptable carrier, and may include other substances and excipients as may be required 

Polyepitope polypeptides may be fbimulated into vacdne compositions. Generally 
such compositions will comprise a pharmaceutically acceptable or veterinary acceptable 
earner and may include adjuvants (eg an ISCOM™ adjuvant, DEAE, polysaccharides 
liposomes and virus-like particles), and other substances and exdpients as may be 
required. For example, the vaccine compositions may include immunomodulatory 
compounds (eg cytokines), and other proteins/ compounds (eg melittin or regulatory 
proteins). The vaccine compositions may also include helper epitopes/CD4 epitopes or B- 
cell epitopes. The vaccine compositions may be adapted for administration to a subject by 
for example, intramuscular injection, nasal administration via an aerosol spray or oral 
administration. Preferably,- the vaccine compositions are ISCOM™ adjuvant compositlons 

Polyepitope polypeptides may also be administered to a subject in the form of a 
viral vaccine (eg a recombinant polyepitope vaccinia or adenovirus) orDNA vaccine 
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Thus, in a further aspect, the present invention provides a polynudeotide vaccine 
coming a polynudeotide encoding a poIypept.de designed in accordance with the 
-"^^a^^aphartnaceuncaBya^We^^^ 

Throughout this spedfication the word "comprise", or variations such as 
comprises" or •'comprising", will be understood to im ply , he fusion of a stated element 

element, integer or step, or group of elements, integers or steps. 

The term "substantially corresponding" as used herain in relation toan amino acid 
sequence ,s .mended to encompass fhe exact amino acid sequence as well as minor 
10 vanattons ^* no, ^m, ,„ a decrease ■„ the biological achvity of the amino d d 
sequencetegvariaHonswhichdono.diminish.heabi.iryofanepitopetoprt.voheaCn. 
= — smayincudeoneormoreconserla^amin: .? 

subsntuuons. The conservaHve amino add substitutions envisaged are: G.A.V.I L M- 

B.E,N.Q,S.CT ;KR .H, a ndP.Na-aJky,aminoadds 

W be • ^^"""'^^'^'"^^^^"^^"hichhas 
beentndudedmfhepresentspedncabonissoWytormepurt^ofprovfdingacontext 
orthepresenHnvenuon. It is fheretore, no, to be taicen as an adtnlssi l^ZTo, 

^marte.fbrmpac.ofmeprtorartbase or we re common genera, krtowledge ' Il 
20 pnoniydateofthepresenlspedncation. »n"mgor 

limiting examples and accompanying figures. 

grief Description of th. ^ ~ T panvln ^ f , r ,„ r 

" ™, .r^ rc ? rortdCS * eeP ' toperonfl ^ a,io " andam '"'>^^q»-cesforEBV- 
rtWrt^rm-™ Numbers above ephopes represent 

acetyl N-teminal and an amide C-terminal. 

D values ^ and PT26B. Hydrophobidty 
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Figure 4 shows Coomassie stained SDS-PAGE gels showing fh. M 
expression lowing induction with 1PTG (+IPTG or £ t^T"""* 
30*Da>, (B) gp350/Pr26A (at a Pproximately 80kDa) , and (QIT26B^ 
indicatethelocationofrecombinantprotein ^ 

^^--^^OTassayresultsofCTLresponsestothefiveHLA^ 
elopes contained within the polyepitope polypeptides, l4 A an 1 T 

f ™°" — — <a) io, g gpaso-Pr^cB) 10 „ g ™!;r " ^ 

indicatingthenu.beroflPN-yspotsproduced. ? ? P ~ " ^ 

MATERIALS AND METHODS • • OSmates, 

Epitope sequences 

The 26 CTL epitopes for inclusion in an EBV varri™ *k 
ortS i n«ea n dHLA ly pe areihownlnTablei a T ™— ' " ~ *» •** *'y 
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TABLE 1. CTL epitopes included in the EBV polytopes 
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Design /Ordering of epitopes 

The following method was used to generate ordered arrangements of CTL epitopes 
to produce a polyepitope sequence with favourable hydrophobidty characteristics: 

(i) The hydrophobic value for each epitope was calculated using a suitable algorithm 
(ie Pinsoft 2 from Mhnotopes Pty Ltd, specifying the N-terminus as N-acetyl and the C- 
terminus as carboxy amide). 

(ii) The set of epitopes was ranked in order of decreasing hydrophobidty. 

Oii) The rank ordered set of epitopes was divided into 3 equal groups (ie group 1 = most 
hydrophobic, group 2 = middle hydrophobidty and group 3 = least hydrophobic (most 
hydrophilic)). Residual epitopes (ie 2 epitopes left over after the set of 26 was divided by 
3), were included in the most hydrophilic group. . 



dv) Tnplets of epitopes were created by taking the most hydrophilic of group 2 (ie last 
m group 2), then the most hydrophobic epitope (ie number 1 in group 1) and lastly the 
most hydrophilic (ie last in group 3). This was continued until all epitopes in groups 1 and 
2 had been used (nb "Leftover" epitopes were added to the C-terminal end of the final 
20 sequence arrangement). 

(v) The triplets were then arranged into a sequence in the order in which they were 
produced (ie Triplet 1 - Triplet 2 - Triplet 3 - etc . 
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(vi) The hydrophobidty of this triplet arrangement was then plotted using a suitable 
algorithm (ie Fauschere and Piiska)). 

(vii) If and where necessary, relocating triplets from areas of low hydrophobidty into 
areas of high peak hydrophobidty in order to reduce hydrophobic amplitude. 

(viii) Re-calculating the hydrophobidty plots and continuing to shuffle triplets as in the 
step (vii) above, (nb It is possible to move individual hydrophobic (ie group 1) epitopes 
from areas of peak hydrophobidty into areas of low hydrophobidty or vice-versa, to 
further reduce peak hydrophobidty amplitude, however, this should not be necessary as 
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the ordering process above minimises peaks). • 

Ox) Any residual epitopes (ie least hydrophilic of group 3) can be placed C-terminally 
in the ftnal sequence arrangement or can be used to further reduce local peaks in 
5 hydrophobic^ by inserting them adjacent to epitopes of peak hydrophobic^, according 
to a hydrophobicity plot of the assembled triplets. 

(x) Any affinity tags (usually hydrophilic, eg a hexa-histidirie sequence) should be 
located either N- or C- terminally or preferably C-terminally if the construct is a fusion 
10 protein. . 

. (xi) Confirmation of satisfactory HI. 

The example of PT26A is shown in Table 4. 
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Using this process, EBV polyepitope configurations PT26A and PT26B 
Hydrophobic Index (HI) calculations . 



were created. 



HI values for favourable configurations (PT26A, PT26B) were calculated according 
20 to the mathematical expression:- 

e=m+n-l 
HI m -Zx e 

(where m = group number, n = group size, e = integer (epitope) number, x = epitope 
25 hydophobicity value) over values of n from 2 to 5. 

Preparation of recombmant proteins 

The DNA sequences encoding the polyepitopes (PT26A and PT26B) were generated 
from synthetic oligonucleotides using a Splicingby Overlap Extension technique (SOE) 
describedby Horton et al (1995). The codon usage was optimised for R ^/expression 
(Wada et al 1992). The polyepitopes were tagged at the C-terminus with a hexa-histidine 
tag for protein purification and detection. The DNA was subcloned into P ET28b 
(Novagen) and transformed into R «>//BL21(DE3) cells (Novagen) for expression. 
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A fragment corresponding to the N-terminal region (amino acid residues 21 - 447) 
of EBV g P 350 was amplified, from plasmid DNA containing the full length g P 350 sequence 
by PCR using the following oligonucleotides: 

5' AGGGATCCCATGGAAGATCCTGGTTTTTTC 3' (forward) and 
5' TCTAGAGGTCGACACCTGTCGTTGTATTGGG 3' (reverse). This DNA fragment was 
subcloned into P ET28b (Novagen) containing the polyepitope insert, resulting in an in- 
frame fusion between g P 350 and the polyepitope polypeptide. The constructs were . 
referred to as gp350/PT26A and gp350/PT26B. 

For protein expression testing, transformed cells were grown in 50mg/ml 
Kanamycin containing L broth at 37°C until OD600 reached 2. Protein expression was 
induced by the addition of IPTG (0.25mM) and cells were grown for another 3hr. Cells 
were pelleted and boiled in SDS sample buffer before analysing by SDS-PAGE. 

For protein purification, cells induced by IPTG were pelleted, resuspended in 
binding buffer (20mM Tris-HCl P H 7.9, 0.5M NaCl, 5mM imidazole) and then sonicated. ' 
Inclusion bodies were pelleted and washed in buffer. The proteins were solubilised 
overnight in binding buffer containing 8M urea and purified on a Ni ++ -NTA column. 

Preparation ot JSCOM™ formulations 

ISCOMATRDC™-adjuvant was prepared by combining adjuvant components in a 
formulation vessel. Cholesterol, 1,2, dipalmitoyl phosphocholine (DPPC), and 
ISCOPREP™ as source of purified Quillaja saponins, were mixed in a ratio of 1:1:5 in the 
presence of the detergent Mega-10 (United States Patent No. 5,679,354) at a concentration of 
2%. The detergent was removed by diafiltration with PBS and the formation of 
ISCOMATRIX™ confirmed by negative contrast electron microscopy revealing complexes, 
including cage like structures with a diameter of 40nm. ISCOM™-adjuvanted vaccines 
were prepared by mixing the EBV polyepitope antigen with preformed ISCOMATRIX™- 
adjuvant, which was prepared as described below: The dose strength of KCOM-adjuvant 
as saponin was quantified by reverse phase HPLC assay. 

ISCOM™ vaccines were prepared by gent le mixing at 22«C of an equal volume of 
2x final dose strength ISCOMATRDC™ with an equal volume of 2x final dose strength EBV 
polyepitope antigen (gp350-PT26A and B). After 60 minutes, the formulation was 
subjected to extensive dialysis, in order to remove urea, at 4°C into PBS buffer pH6.2 using 
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12,000 molecular weight cut off dialysis membrane (Cellu Sep T3, San Antonio TX). 

Mouse Immunogenidty 
Dosing • 

Female HLA A2 transgenic C57B1/6 mice (HDD) were bred at Queensland Institute 
of Medical Research (QIMR) and immunised at 5-7 weeks of age. Mice were housed in 
filter-topped cages in the PC3 animal facility at QIMR. Groups of 4 or 5 mice were dosed 
sub-cutaneously at the tail base with 0.1ml formulation. This was followed by removal of 
spleens at day 21 for ex- vivo ELISPOT assay (below). 

Mice dosed sub-cutaneously, received lOug ISCOMTM-adjuvant (as saponin) ^ 
lO^ig EBV polyepitope antigen. For a vaccine control group, mice were dosed sub- 
cutaneously with a peptide mixture comprising free peptides (Mimotopes Pty Ltd) 
formulated with tetanus toxoid and Montanide ISA 720 (SEPPIC, Paris, France) as 
previously described (Elliot et al, 1999). Peptide control immunisations come from two 
groups of mice, one group immunised, with GLCTLVAML/ YLLEMLWRL/ LLDFVRFMGV 

peptide mixture and the other YLQ/CLG. A mixture of all five epitopes showed some 
insolubility problems. . . 

CTL activity (ex- vivo ELISPOT) 

Ex vivo ELISPOT measures/ quantitates both effector and memory CTL which 
secrete IFNy. Pepti de-specific IFN Y secreting cells are enumerated by an enzyme linked 
l mmuno-spot(ELISpot) assay modified from Murali-Krishna etal. Flat bottomed 96-well 
microtitre plates are coated overnight with 5 ug/mL of rat anti-mouse IFNy antibody(clone 
RA-6A2, BD PharMingen, San Diego, California). Coated plates are then blocked for 1 
hour with 1% foetal FBS in PBS, and then washed three times with PBS/0.05% Tween 20 
(PBS-T), and incubated for 1 h at 37»C with medium comprising RPMI 1640, supplemented 
with 100 ug/ml streptomycin and 100 IU/ml penicillin, 10% FBS and 10 s M 2- 
mercaptoethanol. Mouse splenocytes were then treated with red blood cell lysis buffer 
washed and resuspended to 1 x l(f cells/mL in medium, for use in the ex vivoTFNy 
ELISPOT assay. Splenocytes (1 x 107 well) are then placed in the first wells of the ELISpot 
plate and serially diluted two fold. Recombinant human IL-2 (kindly provided by Cetus 
Corp. Emeryville, California) is then added to the plate at a final concentration of 5 
IU/ well together with EBV peptide (Mimotopes Pty Ltd) at a final concentration* 100 
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ug/ml. MediacontaininglL^withoutpeptideisaddedtocontrx,! wells. The final volume 
in eachwell is 100 ul. Plates are incubated at 37*C in 5% CG 2 for approximately 18 hours. 
After incubation, cells are lysed by rinsing, the plates in H 2 Q and then washed twice in PBS- 
T. B 10 unylated anti-mouse IFNy antibody (commercial source) is diluted 1:500 (2 ug/ml 
5 final concentration) in PBS-T/5% FBS and added to all wells at 50 m/ well and incubated 
hours at RT. Plates are then washed in PBS-T and streptavidin-alkaline phosphatase, 
diluted 1:400 in PBS-T/5% FBS, is added at 50 ulperwell and incubated forafurther2 
hours. After washing, plates are developed by adding Sigma Fast BCIP/NBT substrate at 
50 ul/ well. Plates are incubated at 37°C for approximately 20 minutes to allow colour 
> development, and then washed with water to stop the reacnon. IFN Y specific spots are 
counted using KS ELBPOT Reader (Zeiss). 

RESULTS 

Epitope Fusions 

26 EBV CTL epitopes were selected to provide >90% human population coverage 
for a vaccine formulation. 

In order to link these CTL epitopes (Table 1) together and facilitate the design of a 
polyepitope antigen to form the basis of an EBV prophylactic vaccine, the hydrophobic^ 
values of each epitope were calculated using Pinsoft 2 software. Two versions of 26 
epitopes were then ordered into configurations PT26A and PT26B (Figure 1) which 
mmunised peak hydrophobic^ and hydrophobic sequence length (Figure 2). When these 
constructs were cloned for expression in R coliit was found that only one of the 
- configurations (PT26A) was able to produce a polyepitope polypeptide (Figure 4A). PT26B 
was not produced (Figure 4C). 

To identify a potential reason for this unexpected finding, local areas of high 
hydrophobic^ were examined by summation of overlapping hydrophobic^ values 
(Pinsoft 2, Mimotopes Pty Ltd) to provide a hydrophobic^ index (HI) over varying 
numbersofpepudesin a group(n). For n=2, no correlation was apparent. However,for 
n=3 and n=4 (Table 2), the highest HI values for the expressed sequence (PT26A) were 
lower (1,79 and 2.51 respectively) than the highest values obtained for the non-expressed 
sequence (2.02 and 2.54 respectively). For n=5, again no significant differences were seen. 

I indiCatC ^ ^ ^ ° f h * her Mrophobicity in PT26B 

tnanPT26A. 
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The above analysis can be represented by the mathematical expression:- 

e=m+n-l 
HI m = Sx e 
e=m 

5 (where m = group number, n = group size, e = integer (epitope) number, x = epitope 
hydophobicity value). 

Overall, the results with n=3 and n=4 appeared to show the most distinctive 
differences. This would make the prediction that to be able to express linked, random, 
short amino acid sequences in & colim SDS PAGE detectable amounts, the hydrophobic 
10 index over groups of three epitope sequences would need to be less than 1.8 (Hl 3 < 1.8) 

and/or that over groups of four < 2.5 , where x was calculated using Pinsoft 2 (Mimotopes 
Pty Ltd) and specifying the N-terminus as N-acetyl and the C-terminus as carboxy amide. 
Different cut-off values will be obtained with different hydrophobicity algorithms. 

Table 3 shows the HI values over 3 peptides (n=3) for 15 random arrangements of 
15 the 26 EBVCTL epitopes that were generated and analysed by calculating the HI. This 
shows that all 15 random configurations contain multiple HI values which are all in the 
range predicted to preclude the production of a recombinant polypeptide in R «?//(ie HI 
greater than 1.8). 



23 



w 





1.16 






ON 
|-H 


1.46 


1.95 


2.01 


t< 
l-H 


1.38 


2.41 


1.81 


1.69 


2.02 


2.22 


CM 


1.68 


1.69 


2.03 


3 



S 5§ 

t-H i-H 

3 S 



rH rH 
rH 

9 NO 
r-5 rH 

t-i r-i 

rH rH 

5? % 



rH *H 



rH vo 



rH c> 

a a 



2 2 
S 3 



e e 



3 3 

CN rH 

CM CN 
°° 

^ CN 

CN ^ 



3 

rH 

ON 

3 



CN 



a 

3 



53 



rH rH 



r-4 rH 



00 rH 
rH cN* 



rH rH 
< CQ 

I B 



26 



5t! ON Ok 

rn 



3 3. 22S3 2 3^s^ 

rt r 4 rH 



1 S 



9 

O 



<3 



2 



£s eo 

3 S 



3 

1 



2 



t 



2 



3 



3 

S3 
I 

a 



s 



i-3 



CP 

o 



§ 



S3 

i 

<3 



S 



i s 



vol 

sl 



8 



> 



s 



O 

8 



8 I 




§ 



u 
S 



9 



s 



s? 

o 



CO 
00 



oo cn 

CO CO 

o d 



6 



CO 

d 



3 

o 



3 

d 



ID 



VO 
CO 




2 



oo 
PQ 



CO 



■S 



PQ 



«n 

S3 S 3 



o 



# 



27 



58 



in 



o 




141358971 



28 

Fusions of thesepolyepitopes with the ^ternunal 400 anuno acids of a naturaUy^ccurring 
EBV protein (gp350). e 

Provision of CD4 help has previously been shown to improve CTL induction (Thuy 
et al, 2001) and the EBV structural protein g P 350 was identified as the preferred candidate 
to provide this property because it would provide cognate help. Hence the two 
polyepitopes, PT26A and PT26B, the latter of which was unable to be expressed in E coii 
were cloned onto the C-terminus of the N-tenninal 400 amino acids of g P 350 (Figure 3) and 
expressed as the fusion protein in R coii Both of these fused polypeptides were well- 
produced, being clearly visible on a Coomassie-stained gel above the background of E coii 
proteins (profile for FT26A shown in Figure 4B). 

CTL activity 

Vaccination of HLA A2 transgenic mice with either g P 350-PT26A-KCOM™ vaccine 
or g P 350-PT26B-ISCOM™ vaccine induced a CTL immune response (IFN ELISPOT) to all 
five A2 epitopes (Figure 5), thus indicating that all A2 epitopes were properly processed 
and presented to the immune system. 
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It will be appreciated by persons skilled in the art that numerous variatibns 
and/or modifications may be made to the invention as shown in the specific 
embodiments without departing from the spirit or scope of the invention as broadly 
described. The present embodiments are, therefore, to be considered in all respects as 
illustrative and not restrictive. 

Dated this 12th day of July 2002 



The Council of the Queensland Institute of Medical Research 

By their Patent Attorneys: 

Blake Dawson Waldron Patent Services 
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Figure 3A 
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